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In order to analyze data produced by the kilometer-scale gravitational wave detectors that will begin 
operation early next century, one needs to develop robust statistical tools capable of extracting 
weak signals from the detector noise. This noise will likely have non-stationary and non-Gaussian 
components. To facilitate the construction of robust detection techniques, I present a simple two- 
component noise model that consists of a background of Gaussian noise as well as stochastic noise 
bursts. The optimal detection statistic obtained for such a noise model incorporates a natural veto 
which suppresses spurious events that would be caused by the noise bursts. When two detectors are 
present, I show that the optimal statistic for the non-Gaussian noise model can be approximated 
by a simple coincidence detection strategy. For simulated detector noise containing noise bursts, I 
compare the operating characteristics of (i) a locally optimal detection statistic (which has nearly- 
optimal behavior for small signal amplitudes) for the non-Gaussian noise model, (ii) a standard 
coincidence-style detection strategy, and (iii) the optimal statistic for Gaussian noise. 
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The reliable detection of weak gravitational wave sig- 
nals from broad-band detector noise from kilometer-scale 
interferometers such as LIGO Q and VIRGO Q is the 
primary concern in developing gravitational wave data 
analysis strategies. Because of the weakness of the ex- 
pected gravitational wave signals, it is critical that the 
detection strategy should be nearly optimal. However, 
the detector noise may not be purely stationary and 
Gaussian, so it is important that the detection strategy 
also be robust so that detections will be reliable. 

Until now, most work on the development of data anal- 
ysis strategies has been limited to the case of stationary 
Gaussian noise, though there has been some work || on 
the creation of vetoes that will discriminate between ex- 
pected gravitational wave signals and non-Gaussian noise 
bursts. Because the properties of the noise in the LIGO 
and VIRGO detectors will not be known in advance, it 
is difficult to assess how well these strategies and vetoes 
will perform. When real interferometer data from the 
40-meter Caltech prototype is used, it is found that ad- 
ditional vetoes are needed to deal with the abundance of 
false alarms arising from the non-Gaussian noise f4|||. 

In this paper, I present a simple non-Gaussian noise 
model, consisting of Poisson-distributed noise bursts, 
that represents a number of potential non-Gaussian noise 
sources that may be present in future interferometers. 
This noise model is less naive than the usual assumptions 
of Gaussian noise alone, but is simple enough that many 
analytical results can be obtained. By using this noise 
model, the robustness of various detection strategies can 
be assessed. A general introduction to signal detection in 
non-Gaussian noise can be found in Ref. §. The new re- 
sult in this paper is the use of the non-Gaussian noise 
model in examining the performance of simple multi- 
detector search strategies, which will be important for 
gravitational wave searches. 

I will adopt the following notation in this paper. The 



detector output is a set of N values that are written 
collectively as a vector h in a A^-dimensional vector 
space V. This vector can be thought of as having com- 
ponents hj — (h,ej), j 6 [0, iV — 1], which represent 
a time series of sample measurements made by the de- 
tector. Here, is the appropriate Cartesian basis on 
V. Alternatively, the vector can be expressed as the set 
of components {fi2k, h 2 k+i}, k e [0,N/2 — 1] — the real 
and imaginary Fourier transform components of {hj}. 
These components are given by hj — (h, e^), where 
e 2fc = (2/N) 1 / 2 Y,fj 1 e j cos(27rjk/N) and e 2fc+1 = 

(2/A0 1/2 E^o 1 e 3 sin(2wjk/N), k e [0, N/2 - 1]. Thus, 
the vectors can be treated in either a time representation 
or a frequency representation. 

A natural inner product of two vectors, a and b, in 
this vector space is defined as (a, Qb) . The kernel Q 
is the inverse of the auto-correlation matrix, R, of the 
Gaussian component, nc, of the detector noise. Thus, 
R = (nc <8> no), where the angle brackets denote an 
average over an ensemble of realizations of detector noise. 
Vector norms are defined in terms of this inner product, 
i.e., ||a|| 2 = (a, Qa), as are unit vectors: a = a/||a||. 

The detector noise n consists of two components: (i) 
the Gaussian component no (that is always present) and 
(ii) a possible noise burst component ne that is present 
with probability Pb- The Gaussian component has the 
probability distribution p [uq] oc exp(— 1 1 riQ || 2 /2) . If the 
burst component is randomly distributed in the vector 
space V with normalized measure D[iib], then the prob- 
ability distribution for the noise is 
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Suppose that the noise bursts are uniformly-distributed 
in the vector space V out to a large radius R. Such 
bursts will typically last the entire duration of interest 
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(TV samples) and fill the entire frequency band. The noise 
distribution is approximately 
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for ||n|j < R, where e ~ 2 N / 2 T(N/2 + l)R- N P B /(l- P B ) 
for large R. 

An alternative derivation of Eq. (|2|) is the following. 
Suppose the noise burst is simply an additional transient 
source of Gaussian noise. Then Eq. (@) can be written as 



p[n] oc e -(-Q")/ 2 (l + £e (n,[Q-Q»/2 ); 



(3) 



where e = (det |R|/ det \K'\) N / 2 P B /(1 - Pb), K' is the 
auto-correlation matrix for both ambient and the burst 
components of Gaussian noise together, and the matrix 
Q' is the inverse of R'. If the typical noise burst is much 
louder than the ambient Gaussian noise, then Q — Q' « 
Q. Therefore, in the case of loud Gaussian noise bursts, 
Eq. (|J) has the same form as Eq. (Q). 

The likelihood ratio can now be computed for the two 
alternative hypotheses: that the output h = n is noise 
alone (Ho), or that the output h = Aii + n is a signal Au 
of amplitude A embedded in noise (i?i).f] The likelihood 
ratio A(^4) = p[h \ H{\/p\h. \ H ] is the ratio of the poste- 
rior probability of obtaining the observed output h given 
hypothesis H\ to the posterior probability of obtaining h 
given hypothesis Ho. One then finds that the likelihood 
ratio is 



where 



and 
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A g (A) + , 
1 + a 



,A(ii,Qh)-A 2 /2 



(4) 



(5) 



(6) 



The quantity Aq(A) is the likelihood ratio one would ob- 
tain if only the Gaussian noise component were present; 
it is a monotonically increasing function of the matched 
filter (u, Qh). When a non-Gaussian noise burst compo- 
nent can occur, the likelihood ratio depends on an addi- 
tional measured quantity: the magnitude of the output 
vector, ||h||. In addition to these functions of the detector 
output, the likelihood ratio also depends on the expected 
amplitude A of the signal and the proportionality factor 
e, which encapsulates the probability of a noise burst and 
the maximum possible amplitude R of the burst. 



*In this paper, I consider only signals which are completely 
known (up to their amplitude). The generalization of this case 
to one in which the signals have an unknown initial phase is 
straightforward: see, e.g., Ref. |7T. 



The quantity a plays a central role in the modified 
likelihood ratio of Eq. (||) : it acts as a detector of noise 
bursts, and vetoes events that are more likely due to the 
burst. The logarithm of a is proportional to the amount 
of power in the detector output. For Gaussian noise 
alone, the expected value of the power is (||iig|| 2 } = A, 
and thus (In a) - \nP B - ±N\n(R 2 /N) for large N. 
Thus, for R 2 > N (as was assumed above), the value of a 
will be small. However, when a noise burst is present, the 
value of In a is increased by a typical amount ~ ^i? 2 , so a 
will typically become large. In fact, a is the excess-power 
statistic for detection of arbitrary noise bursts For 
small values of a, the likelihood ratio approaches the 
usual Gaussian noise likelihood ratio. However, for large 
values of a, the likelihood ratio approaches unity. 

The likelihood ratio is a function of the detector output 
via the two quantities x — (u, Qh) = ||h|| cos# and ||h||. 
The likelihood ratio also depends on the expected signal 
amplitude A and the factor e. Thus, the likelihood ratio 
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Figure [l] shows the likelihood ratios as functions of the 
matched filter statistic x and the angle 8 for a given value 
of e and A. Notice that the likelihood ratio is attenuated 
when the magnitude of the output, h, is much larger than 
the largest expected signal; this attenuation occurs at 
smaller signal-to-noise ratios for smaller absolute values 
of the direction cosine cos 8. 

Because of the non-trivial dependence of the likeli- 
hood ratio on the expected signal amplitude, the opti- 
mal statistic for signals of some amplitude A will not 
be the same statistic as the optimal statistic for a dif- 
ferent amplitude A'. This situation is unlike the case 
of purely Gaussian noise, for which the likelihood ratio 
grows monotonically with the matched filter output, re- 
gardless of the expected signal amplitude. Because the 
amplitude of a signal will not typically be known in ad- 
vance, it is useful to consider a locally optimal statistic 
Q , which provides the optimal performance in the limit 
of small amplitude signals. The rationale behind such a 
choice is that large amplitude signals pose no challenge 
for detection, so any reasonable statistic will suffice for 
them. The locally optimal statistic, which is defined as 
A = din A/(1A\a=o = (u, — Vlnp[h]), is given by 



A 
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where x = (u, Qh) is the matched filter. Notice that 
the locally optimal statistic also incorporates a veto 
based on the value of a = eexp[(a;sec#) 2 /2]. The lo- 
cally optimal statistic grows approximately linearly for 
x < (—2 In e) 1 / 2 cos 8, at which point it is effectively cut 
off. This is the same general behavior as was seen before 
for the likelihood ratio A(A), but there is no longer any 
scaling with a prior amplitude. 
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Another approach is to use a prior distribution f(A) 
for the amplitude, if such a distribution is known, to con- 
struct an integrated likelihood ratio A = J A(A)f(A)dA; 
this integrated likelihood ratio would then be the optimal 
statistic for detection of all potential signals. Even if the 
prior distribution is not known, it is possible to obtain an 
approximate likelihood ratio if f(A) is a slowly- varying 
function. One can show that A G = j A G (A)f(A)dA w 
f(x) exp(x 2 /2). A further approximation neglects the 
slowly-varying function entirely; one then obtains Aq ~ 
exp(x 2 /2). Thus, 

x 2 /2 , (xscc6) 2 /2 

A«- m ■ (9) 

\ _j_ ee (xsec6) 2 /2 ' 

It is important to generalize the above analysis to 
the case in which there are multiple detectors. Such 
a situation is deemed to be essential for gravitational 
wave searches since a coincident detection is an impor- 
tant corroboration that a signal is of an astronomical 
origin rather than internal to the detector. As a sim- 
ple model|] consider two gravitational wave detectors 
with identical but independent noise (including possible 
noise bursts) that also have the same response to gravita- 
tional wave signals. If the noise was purely Gaussian, the 
likelihood ratio for the two detectors together would be 
Aq B = Aq x Aq, i.e., the product of the likelihood ratios 
for Gaussian noise in each separate detector. The re- 
sulting likelihood ratio is thus an monotonically increas- 
ing function of the sum of the matched filter values in 
each detector: In Aq B ~ x A + x B where x A = (u, Q A h A ) 
is the signal-to-noise ratio in the first detector (A) and 
x B = (u, Q B h B ) is the signal-to- noise ratio in the second 
detector (B). If the likelihood ratios are integrated over 
a slowly-varying prior distribution of possible signal am- 
plitudes, then one finds lnAQ B ~ (x A + x B ) 2 , which also 
depends on the sum of the two detectors' matched filter 
values. (In this case, it is important to integrate over 
the distribution of amplitudes after multiplying the two 
detector's likelihood ratios together since the same signal 
amplitude should be present in each detector.) Thus, one 
would decide that a signal was present if the sum of the 
matched filter values in the two detectors exceeded some 
threshold. An alternative strategy would decide that a 
signal was present only if each of the matched filter val- 
ues exceeded some threshold, i.e., one thresholds on the 
quantity min(a; A , x B ) rather than on x A + x B . 

Suppose the likelihood ratios for both detectors, 
A A (x A ,6» A ) and A B (ir B ,6l B ), are given by Eq. (g). Then, 
the likelihood ratio for the combined detector is given 
by the product of A A and A B . Figure |2| shows a plot 
of contours of constant likelihood ratio as a function of 
the two measured signal-to-noise ratios for 8 A = 9 B = 
and 8 A — 0, 9 B = n/6. At low signal-to-noise ratios, the 



+ A more general model is being considered by Finn [fiffl 



contours are approximately lines of constant x A + x B — in 
this regime, the likelihood ratio behaves like the Gaussian 
likelihood ratio. However, for large values of either x A 
or x B , the contours are approximately lines of constant 
min(:r A , x B ). 

The locally optimal two-detector statistic is A AB = 
A A + A B , where A A and A B both have the form of Eq. (j|). 
This statistic also depends on x A , x B , 8 A , and 9 B . As 
before, the contours of constant A AB for fixed 9 A and 9 B 
are lines of constant x A + x B for low signal-to-noise ratios 
but are lines of constant min(x A , a; B ) if the signal-to-noise 
ratio in one of the detectors is large. 

These results somewhat justify the use of the minimum 
statistic min(x A ,a; B ) when there are two detectors with 
independent noise operating. When one of the signal-to- 
noise ratios is large but the other is moderate, the mini- 
mum statistic will give approximately the same value as 
the locally optimal statistic. In this case, the effect of the 
a-based veto in the locally optimal statistic is mimicked 
in the minimum statistic. The locally optimal statis- 
tic is better able to deal with the case when there are 
noise bursts present in both detectors, but this case oc- 
curs with probability Pj, so it should be extremely rare. 
Moreover, the minimum statistic is less powerful than the 
locally optimal statistic for small signal-to-noise ratios in 
both detectors. In this case, the locally optimal statistic 
is approximately x A + x B , which is the same as the op- 
timal statistic for Gaussian noise. However, the optimal 
statistic for Gaussian noise may be unsuitable for the 
noise model I have considered because the false alarm 
probability cannot be reduced below ~ P-q for reason- 
able thresholds. (The minimum statistic suffers the same 
problem but around the much lower probability P§.) 

To see the relative performance of these statistics, it is 
useful to consider the operational characteristics of the 
tests. For some fixed false alarm probability Q , the op- 
erational characteristic is the detection probability Q\ as 
a function of signal amplitude. I have computed these 
curves using Monte Carlo techniques. The noise model I 
used corresponds to that of Eq. (Q) with burst probability 
Pb = 1% and maximum burst amplitude R — 25. I fixed 
the false alarm probability to Qo = 1CP 3 and I exam- 
ined a vector space dimension of N = 4. Figure ||| shows 
the relative performances of the three statistics in terms 
of the detection probability as a function of signal ampli- 
tude. One sees that the locally optimal statistic performs 
the best for small amplitude signals (as expected), but 
that it has poor performance for large amplitude signals. 
This is because the large amplitude deviations are inter- 
preted as noise bursts and are suppressed. (If the locally 
optimal statistic were used in a real search, these large 
events would not be rejected outright but would rather 
be subjected to further scrutiny; thus the attenuation of 
the locally optimal statistic for large signal amplitudes 
is somewhat misleading.) The sum of the signal-to-noise 
ratio, which is optimal statistic for Gaussian noise, is 
seen to have poor performance. The reason is that be- 
cause the false alarm probability is much smaller than the 
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burst probability, the threshold required for this statis- 
tic becomes unreasonably large. However, since the false 
alarm probability is much higher than the burst probabil- 
ity squared, the minimum statistic performs reasonably 
well for both small and large amplitude signals. 

The conclusion that can be drawn from this analysis is 
that search strategies based on a Gaussian noise model 
can be significantly improved by considering a more real- 
istic non-Gaussian noise model, which may contain noise 
bursts that occur relatively frequently. If a method can 
be devised to detect these noise bursts and veto data that 
contains a noise burst, then the use of such a veto ef- 
fectively reproduces the locally optimal strategy for the 
noise model containing bursts. However, the necessary 
veto may not be easily found since it may not be possible 
to determine the non-Gaussian properties of the detector 
noise to sufficient accuracy. A viable alternative when 
two detectors are operating is to use a coincidence strat- 
egy of detection. Since the minimum statistic used in 
the coincidence strategy is a robust statistic, one would 
expect that it should have good performance for any re- 
alistic noise model. 
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ful comments on this paper. This work was supported 
by the National Science Foundation grant PHY-9424337. 
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FIG. 1. Likelihood ratio as a function of the matched fil- 
ter signal-to-noise ratio x = (u, Qh) and the direction cosine 
cosf? = (u, Qh). The model signal and noise burst both lie 
in an A = 4 dimensional vector space V . The model signal 
has an amplitude A = 5. Noise bursts occur with a prob- 
ability Pb = 1%; the possible burst vectors are uniformly 
distributed in V out to a maximum radius R — 25. The thin 
solid line is the likelihood ratio if no noise bursts were present 
(e = 0), while the thick dotted lines are plots of Eq. (fjj) with 
e — 2 x 1CP 7 (corresponding to the above parameters). 
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FIG. 2. Contours of constant likelihood ratio for two de- 
tectors, A and B, measuring signal-to-noise ratios x A and x B . 
The combined likelihood ratio is the product of individual de- 
tector likelihood ratios, which are of the form of Eq. ^ with 
A = 5 and e = 2 x 10 -7 . The top plot represents the case 
when B A = 8 B = while the bottom plot represents the case 
when 9 A — and 6 B — tt/6. Notice that the right side (be- 
yond x B ~ 5) of the bottom plot is compressed compared 
to the top plot, while the left hand side of the two plots are 
roughly the same. 




signal amplitude A 

FIG. 3. The operation characteristics, which express the 
detection probability Q\ as a function of signal amplitude 
A for fixed false alarm probability Qo — 10 -3 for three 
multi-detector statistics. The "sum" statistic adds the sig- 
nal-to-noise ratios of the two detectors; this is the optimal 
statistic for purely Gaussian noise. The "min" statistic is 
simply the minimum of the two signal-to-noise ratios. The 
"opt" statistic is the locally optimal statistic for the noise 
model. The noise model has a burst probability of Pb = 1%, 
a maximum burst amplitude of R = 25, and a vector space 
dimension iV = 4. 
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